(12) INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(19) World Intellectual Property Organization 

International Bureau 

(43) International Publication Date 
27 November 2003 (27.11.2003) 




PCT 




(10) International Publication Number 

WO 03/097794 A2 



(51) International Patent Classification 7 : 



C12N 



(21) International Application Number: PCT/US03/1449 1 



(22) International Filing Date: 
(25) Filing Language: 



(26) Publication Language: 



7 May 2003 (07.05.2003) 

English 
English 



(30) Priority Data: 

10/151,061 



16 May 2002 (16.05.2002) US 



(71) Applicant (for all designated States except US): AP- 
PLERA CORPORATION [US/US]; 850 Lincoln Centre 
Drive, Foster City, CA 94404 (US). 

(72) Inventors; and 

(75) Inventors/Applicants (for US only): LAO, Kai, Qin 

[US/US]; 6384 Farm Hill Way, San Jose, CA 95120 
(US). CHEN, Caifu [CN/US]; 777 San Antonio Road, 
#91, Palo Alto, CA 94303 (US). KOEHLER, Ryan, T. 
[US/US]; 3175 Cromwell, Hay ward, CA 94542 (US). 
SCAFE, Charles [US/US]; 1111 Carolina Street, San 
Francisco, CA 94107 (US). SCHROTH, Gary [US/US]; 
100 Monasterio Court, San Ramon, CA 94583 (US). 

(74) Agent: HUNT, Dale, C; KNOBBE, MARTENS, OLSON 
& BEAR, LLP, 2040 Main Street, 14th Floor, Irvine, CA 
92614 (US). 



(81) Designated States (national): AE, AG, AL, AM, AT (util- 
ity model), AT, AU, AZ, BA, BB, BG, BR, BY, BZ, CA, 
CH, CN, CO, CR, CU, CZ (utility model), CZ, DE (util- 
ity model), DE, DK (utility model), DK, DM, DZ, EC, EE 
(utility model), EE, ES, FT (utility model), FT, GB, GD, GE, 
GH, GM, HR, HU, ID, IL, IN, IS, JP, KE, KG, KP, KR, KZ, 
LC, LK, LR, LS, LT, LU, LV, MA, MD, MG, MK, MN, 
MW, MX, MZ, NI, NO, NZ, OM, PH, PL, PT, RO, RU, 
SC, SD, SE, SG, SK (utility model), SK, SL, TJ, TM, TN, 
TR, TT, TZ, UA, UG, US, UZ, VC, VN, YU, ZA, ZM, ZW. 

(84) Designated States (regional): ARIPO patent (GH, GM, 
KE, LS, MW, MZ, SD, SL, SZ, TZ, UG, ZM, ZW), 
Eurasian patent (AM, AZ, BY, KG, KZ, MD, RU, TJ, TM), 
European patent (AT, BE, BG, CH, CY, CZ, DE, DK, EE, 
ES, FT, FR, GB, GR, HU, IE, IT, LU, MC, NL, PT, RO, 
SE, SI, SK, TR), OAPI patent (BF, BJ, CF, CG, CI, CM, 
GA, GN, GQ, GW, ML, MR, NE, SN, TD, TG). 

Declaration under Rule 4.17: 

of inventorship (Rule 4.17 (iv)) for US only 

Published: 

without international search report and to be republished 
upon receipt of that report 

For two-letter codes and other abbreviations, refer to the "Guid- 
ance Notes on Codes and Abbreviations" appearing at the begin- 
ning of each regular issue of the PCT Gazette. 



(54) Title: UNIVERSAL-TAGGED OLIGONUCLEOTIDE PRIMERS AND METHODS OF USE 



< 

o 

2 
o 



universal-tagged oligonucleotide primer 






specific 




random 




















♦ 
t 
i 


i 





target 

(57) Abstract: The present invention relates to universal-tagged oligonucleotide primers, and to methods of using the primers for 
amplifying the genome. 



WO 03/097794 



PCT/US03/14491 



UNIVERSAL-TAGGED OLIGONUCLEOTIDE PRIMERS AND METHODS OF USE 
Field of the Invention 

[0001] The present invention relates to universal-tagged oligonucleotide primers, 
and to methods of using the primers for amplifying the genome. 
Introduction 

[0002] Whole genome amplification (WGA) is a valuable technique for 
amplification of the genome from minimal or limiting amounts of DNA for subsequent molecular 
genetic analysis. It is desirable that whole genome amplification is conducted to ensure that the 
amplification is not biased, meaning that all sequences in a sample should be amplified to the 
same extent. 

[0003] Whole genome amplification may be performed using either conventional or 
nonconventional PGR amplification methods. Conventional PCR entails the amplification and 
subsequent detection of specific DNA sequences which are precisely characterized in length and 
sequence using nondegenerate primers, while random, "non-conventional" PCR involves 
universal amplification of prevailing DNA or amplification of unknown intervening sequences 
which are not generally defined in length or sequence using degenerate primers. 

[0004] The use of specific primers to amplify genomic DNA (gDNA) is not 
practical or economical for multiplex applications, such as analyzing single-nucleotide 
polymorphisms (SNPs), while the use of random primers to amplify gDNA is typically not 
reproducible or predictable due to non-specific priming. 

Summary of the Invention 

[0005] In one aspect, the invention provides a method for amplifying target DNA 
comprising multiple DNA sequences by polymerase chain reaction (PCR) by preparing a mixture 
of (1) the target DNA, (2) a set of single-stranded oligonucleotide primers, each primer 
comprising (i) a 3' specific region, (ii) a random region, and (iii) a 5' universal region, wherein 
the universal region serves as a first universal priming site (Ul) for further PCR amplification 
with a complementary primer, (3) a DNA polymerase, and (4) multiple deoxynucleoside 
triphosphates (dNTPs) under conditions such that the PCR primers anneal to and prepare a copy 
of the multiple DNA sequences by primer extension of the target DNA, yielding an amplified 
product retaining the first universal priming site (Ul). The target DNA amplified by this method 
may be, without limitation, gDNA or cDNA, and may include at least a fraction of the whole 
human genome. If desired, the amplified product retaining the Ul universal priming site may be 
further extended to encode a second universal priming site (U2) 5 and further amplified with two 
primers complementing the first and second universal priming sites (Ul and U2). In one 
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embodiment, the DNA polymerase for amplifying the target DNA is a Taq DNA polymerase, 
such as, for example, AmpliTaq Gold™ DNA polymerase, AmpliTaq™ DNA polymerase, or the 
Stoffel fragment of AmpliTaq™ DNA polymerase. If desired, the method of amplifying target 
DNA may include subjecting the amplified product to single-nucleotide polymorphism (SNP) 
genotyping. In a particular embodiment, the set of oligonucleotide primers used in the 
amplification reaction is designed based upon bioinformatic prediction of expected products, 
using in silico "e-PCR." 

[0006] In another aspect, the invention provides a universal-tagged single-stranded 
oligonucleotide primer for polymerase chain reaction (PGR), comprising, (1) a 3' specific region, 

(2) a random region, and (3) a 5' universal region. 

[0007] In yet another aspect, the invention provides a set of single-stranded 
oligonucleotide primers for amplification of genomic DNA (gDNA) in a polymerase chain 
reaction (PCR), each of the primers comprising (1) a 3' specific region, (2) a random region, and 

(3) a 5' universal region. 

[0008] In a further aspect, the invention provides a primer-target duplex that forms 
between the binding region of the single-stranded oligonucleotide primer and the target DNA. 

[0009] In all aspects of the invention, the 3 ' specific region of the oligonucleotide 
primers may, for example, be about 4 to 12 bases in length. In another embodiment, the 3 5 
specific region may be designed to bind to a genomic sequence occurring in the human genome 
with a frequency of about 0.01% and 2.00%. In a further embodiment, the random region of the 
primers may be between about 2 and 1 5 bases in length. In primers with a random region of 4 
bases in length, the sequence of the random region may include all 256 possible combinations of 
adenosine, cytosine, guanine, and thymidine. In a still further embodiment, the 5' universal 
region of the primers may be designed to have no significant homology to any segment in the 
human genome and further to be between about 12 and 35 bases in length. 

Brief Description of the Drawings 

[0010] Figure 1 shows 5-mers that may serve as a specific region in a single- 
stranded oligonucleotide primer and their frequency of occurrence in the entire human genome. 

[0011] Figure 2 is a schematic of PCR amplification of the target COX6b DNA 
sequence using two single-stranded oligonucleotide primers, each having a 3 ' specific region of 5 
bases in length (5 5 -GCTCG), a random region of 4 bases in length (5 5 -NNNN), and a 5 ? universal 
region (Ul universal primer site or U2 universal primer site) to incorporate a universal Ul and a 
universal U2 priming site into a PCR-generated copy of the target COX6b DNA sequence. 

[0012] Figure 3 is a schematic of the use of Ul and U2 primers to further amplify a 
PCR-generated copy of target DNA containing a universal Ul and universal U2 priming site. 
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[0013] Figure 4 is a schematic of PGR amplification using locked nucleic acid 
(LNA) substituted single-stranded oligonucleotide primers comprising a Y specific region (5'~ 
TTTtTtTtT) and a 5 ? universal region that may serve as a universal priming site for further PCR 
amplification. LNA residues are shown in capital letters, e.g. T, while normal nucleic acids are 
shown in lower case letters, e.g. t. 

[0014] Figure 5 shows a region of the nucleotide sequence of the COX6 gene (SEQ 
ID NO: 35) with sequences underlined to indicate sites for annealing of primers and probes. The 
sequence of primers and probes that anneal to the COX6 gene are shown in capital letters directly 
above the target sequence. 

[0015] Figure 6 shows the Tm calculation for the formation of duplexes between 

target DNA and primers either with or without a universal tag. 

[0016] Figure 7 represents a schematic of the formation of a primer-target 

duplex between a universal-tagged primer and the target DNA. Vertical solid lines are used to 
denote complete hydrogen bonding between bases in the specific region of the primer and the 
target while vertical hatched lines are used to denote less than complete hydrogen bonding 
between all the bases in the random region of the primer and the target. 

Detailed Description 

A. Definitions 

[0017] Unless defined otherwise, technical and scientific terms used herein have the 
same meaning as commonly understood by one of ordinary skill in the art to which this invention 
belongs. One skilled in the art will recognize many methods and materials similar or equivalent 
to those described herein, which could be used in the practice of the present invention. Indeed, 
the present invention is in no way limited to the methods and materials described. For purposes 
of the present invention, the following terms are defined below. 

[0018] "COX6b" when used herein refers to subunit 6b of human cytochrome c 
oxidase as disclosed, for example, in Taanman et aL, Nucleic Acids Res., 17(4): 1766 (1989) and 
Carrero-Valenzuela et al, Gene, 102(2): 229-236 (1991). 

[0019] As used herein, the terms "polynucleotide" and "oligonucleotide" are used 
interchangeably and mean single-stranded and double-stranded polymers of nucleotide 
monomers, including 2'-deoxyribonucleotides (DNA) and ribonucleotides (RNA) linked by 
internucleotide phosphodiester bond linkages, e.g. 3'-5' and 2*~5', inverted linkages, e.g. 3 '-3 5 and 
5 ? -5\ branched structures, or internucleotide analogs. Polynucleotides have associated counter 
ions, such as IT 1 , NH 4 + ? trialkylammonium, Mg 2+ , Na + and the like. A polynucleotide may be 
composed entirely of deoxyribonucleotides, entirely of ribonucleotides, or chimeric mixtures 
thereof. Polynucleotides may be comprised of internucleotide, nucleobase and sugar analogs. 
Polynucleotides typically range in size from a few monomeric units, e.g. 5-40 when they are more 
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commonly frequently referred to in the art as oligonucleotides, to several thousands of 
monomeric nucleotide units. Unless denoted otherwise, whenever a polynucleotide sequence is 
represented, it will be understood that the nucleotides are in 5' to 3 ' order from left to right and 
that "A" denotes deoxyadenosine, "C" denotes deoxycytosine, "G" denotes deoxyguanosine, and 
f T" denotes thymidine, unless otherwise noted. 

[0020] "Nucleotide" refers to a phosphate ester of a nucleoside, as a monomer unit 
or within a nucleic acid. "Nucleotide 5-triphosphate" refers to a nucleotide with a triphosphate 
ester group at the 5 1 position, and are sometimes denoted as "NTP", or "dNTP" and "ddNTP" to 
particularly point out the structural features of the ribose sugar. The triphosphate ester group 
may include sulfur substitutions for the various oxygens, e.g. a-thio-nucleotide 5 '-triphosphates. 
For a review of nucleic acid chemistry, see: Shabarova, Z. and Bogdanov, A. Advanced Organic 
Chemistry of Nucleic Acids, VCH, New York, 1994. 

[0021] "Nucleobase" or "base" when used herein refers to any nitrogen-containing 
heterocyclic moiety capable of forming Watson-Crick hydrogen bonds in pairing with a 
complementary nucleobase or nucleobase analog, e.g. a purine, a 7-deazapurine, or a pyrimidine. 
Typical nucleobases are the naturally occurring nucleobases adenine, guanine, cytosine, uracil, 
thymine, and analogs (Seela, U.S. Patent No. 5,446,139) of the naturally occurring nucleobases, e.g. 
7-deazaadenine, 7-deazaguanine, 7-deaza~8-azaguanine, 7-deaza-8-azaadenine, inosine, 
nebularine, nitropyrrole (Bergstrom, J. Amer. Chem. Soc. 117:1201-09 (1995)), nitroindole, 2- 
aminopurine, 2-amino-6-chloropurine, 2,6-diaminopurine, hypoxanthine, pseudouridine, 
pseudocytosine, pseudoisocytosine, 5-propynylcytosine, isocytosine, isoguanine (Seela, U.S. 
Patent No. 6,147,199), 7-deazaguanine (Seela, U.S. Patent No. 5,990,303), 2-azapurine (Seela, 
WO 01/16149), 2-thiopyrimidine, 6-thioguanine, 4-thiothymine, 4-thiouracil, O^-methylguanine, 
A^-methyladenine, O^-methylthymine, 5,6-dihydrothymine, 5,6-dihydrouracil, 4-methylindole, 
pyrazolo[3,4-D]pyrimidines, "PPG" (Meyer, U.S. Patent Nos. 6,143,877 and 6,127,121; Gall, 
WO 01/38584), ethenoadenine (Fasman (1989) in Practical Handbook of Biochemistry and 
Molecular Biology, pp. 385-394, CRC Press, Boca Raton, Fl), and locked nucleic acid (LNA). 

[0022] "Nucleoside" refers to a compound consisting of a nucleobase linked to the C- 
1' carbon of a sugar, such as ribose, arabinose, xylose, and pyranose, in the natural p or the a 
anomeric configuration. The sugar may be substituted or unsubstituted. Substituted ribose sugars 
include, but are not limited to, those riboses in which one or more of the carbon atoms, for 
example the 2'-carbon atom, is substituted with one or more of the same or different CI, F, -R, - 
OR, -NR 2 or halogen groups, where each R is independently H, Ci-C 6 alkyl or C 5 -Ci 4 aryl. 
Ribose examples include ribose, 2'-deoxyribose, 2',3-dideoxyribose, 2'-haloribose, 2'- 
fluororibose, 2'-chlororibose, and 2'-alkylribose, e.g. 2'-0-methyl, 4'-a-anomeric nucleotides, 1- 
co-anomeric nucleotides (Asseline (1991) Nucl. Acids Res. 19:4067-74), 2'-4'- and 3'-4'-linked and 
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other "locked" or "LNA", bicyclic sugar modifications (WO 98/22489; WO 98/39352; WO 
99/14226). "LNA" or "locked nucleic acid" is a DNA analogue that is conformationally locked 
such that the ribose ring is constrained by a methylene linkage between the 2 '-oxygen and the 3 ? - 
or 4 ? -carbon. The conformation restriction imposed by the linkage often increases binding 
affinity for complementary sequences and increases the thermal stability of such duplexes. 

[0023] Exemplary LNA sugar analogs within a polynucleotide include the 
structures: 




3 '-4' D-form LNA 3 '-4' L-form LNA 

1'R, 3'S, 4'R l'S, 3'R, 4'S 

[0024] where B is any nucleobase. 

[0025] Sugars include modifications at the 2'- or 3'-position such as methoxy, ethoxy, 
allyloxy, isopropoxy, butoxy, isobutoxy, methoxyethyl, alkoxy, phenoxy, azido, amino, alkylamino, 
fluoro, chloro and bromo. Nucleosides and nucleotides include the natural D configurational 
isomer (D-form), as well as the L configurational isomer (L-form) (Beigelman, U.S. Patent No. 
6,251,666; Chu, U.S. Patent No. 5,753,789; Shudo, EP0540742; Garbesi (1993) Nucl. Acids Res. 
21:4159-65; Fujimori (1990) J. Amer. Chem. Soc. 112:7435; Urata, (1993) Nucleic Acids 
Symposium Ser. No. 29:69-70). When the nucleobase is purine, e.g. A or G, the ribose sugar is 
attached to the N 9 -position of the nucleobase. When the nucleobase is pyrimidine, e.g. C, T or U, 
the pentose sugar is attached to the N J -position of the nucleobase (Kornberg and Baker, (1992) 
DNA Replication, 2 nd Ed., Freeman, San Francisco, CA). 

[0026] "Target", "target polynucleotide", and "target sequence" mean a specific 
polynucleotide sequence, the presence or absence of which is to be, detected, and that is the 
subject of hybridization with a complementary polynucleotide, e.g. a primer or probe. The target 
sequence can be composed of DNA, RNA, an analog thereof, and including combinations 
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thereof. The target can be single-stranded or double-stranded. In primer extension processes, the 
target polynucleotide which forms a hybridization duplex with the primer may also be referred to 
as a "template." A template serves as a pattern for the synthesis of another, complementary 
nucleic acid (Concise Dictionary of Biomedicine and Molecular Biology, (1996) CPL Scientific 
Publishing Services, CRC Press, Newbury, UK). A target sequence for use with the present 
invention may be derived from any living, or once living, organism, including but not limited to 
prokaryote, eukaryote, plant, animal, and virus. The target sequence may originate fi-om a 
nucleus of a cell, e.g., genomic DNA, or may be extranuclear nucleic acid, e.g., plasmid, 
mitochondrial nucleic acid, various RNAs, and the like. The target nucleic acid sequence may be 
first reverse-transcribed into cDNA if the target nucleic acid is RNA. A variety of methods are 
available for obtaining a target nucleic acid sequence for use with the compositions and methods 
of the present invention. When the target sequence is obtained through isolation from a 
biological sample, preferred isolation techniques include (1) organic extraction followed by 
ethanol precipitation, e.g., using a phenol/chloroform organic reagent (e.g., Ausubel et al., eds., 
(1993) Current Protocols in Molecular Biology Volume 1, Chapter 2, Section I, John Wiley & 
Sons, New York), or an automated DNA extractor (e.g., Model 341 DNA Extractor, Applied 
Biosystems, Foster City, CA); (2) stationary phase adsorption methods (e.g., Boom et al., U.S. 
Patent No. 5,234,809; Walsh et al., (1991) Biotechniques 10(4): 506-513); and (3) salt-induced 
DNA precipitation methods (e.g., Miller et al., (1988) Nucleic Acids Research, 16(3): 9-10). 

[0027] The term "primer-target duplex" when used herein refers to the product 
comprising a region of a primer that is annealed to a target sequence (Figure 7). 

[0028] The term "universal region 55 also referred to as a "5 5 universal region" or a 
"universal tag" as used herein refers to a region of an oligonucleotide primer that has no 
significant homology to any segment in the genome and is localized to the 5 5 end of the 
oligonucleotide primer. The "universal region" meets all the requirements for a normal 
oligonucleotide primer, such as lack of secondary structure, an appropriate Tm, and an 
appropriate GC content and may be between about 12 and 35 bases in length, between about 15 
and 25 bases in length or between about 18 and 22 bases in length. "Universal priming site" 
when used herein refers to a "universal region" of a primer that may function as a site to which 
universal primers anneal for priming of further cycles of DNA amplification. 

[0029] The term "universal primer 5 ' as used herein refers to a primer consisting of a 
"universal region". 

[0030] The term "universal-tagged oligonucleotide primer 55 or "universal-tagged 
primer" when used herein refers to a primer comprising a "5 '-universal region 5 ', as well as a "3' 
specific region" and a "random region," unless language to indicate the contrary is specified 
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(Figure 7). For example, "universal-tagged primer, having a specific binding region" herein 
refers to a primer of specified sequence, comprising a "universal tag. 5 ' 

[0031] The term "specific region" or "3 ? specific region" as used herein refers to a 
region of an oligonucleotide primer that is able to anneal to specified sites in the genome. Such a 
"specific region" is located on the 3 '-end of the oligonucleotide primer and is able to bind to a 
specific genomic sequence occurring in the human genome with a frequency between about 
0.01% and 2.0%, such as, for example, between about 0.05% and 0.1% or between about 0.1% 
and 0.5%. The length of the "specific region" of a primer depends mainly on the averaged 
lengths of the predicted PGR products based on bioinformatic calculations. The definition 
includes, without limitation, a "specific region" of between about 4 and 12 bases in length. In 
more specific embodiments, the length of the 3' specific region may, for example, be between 
about 4 and 8 bases, or between about 5 and 6 bases. Specific regions having a Tm of between 
about 50°C and 70°C are specifically included within the definition. 

[0032] The term, "specific primer," when used herein refers to a primer of specified 
sequence. 

[0033] The term "random region" as used herein refers to a region of an 
oligonucleotide primer that is able to anneal to unspecified sites in the genome. The "random 
region" facilitates binding of the primer to target DNA and binding of the polymerase enzyme 
used in PGR amplification to the duplex formed between the primer and target DNA. The 
random region nucleotides may be degenerate or non-specific, promiscuous nucleobases or 
nucleobase analogs. The length of the "random region" of the oligonucleotide primer, among 
other things, depends on the length of the specific region. In certain embodiments, without 
limitation, the "random region" is between about 2 and 15 bases in length, between about 4 and 
12 bases in length or between about 4 and 6 bases in length. In another embodiment, the specific 
and random regions combined will be about 9 bases in length, i.e. if the specific region consists 
of 4 bases, the random region will have 5 bases. 

[0034] The term "binding region," unless specified to the contrary, refers to a 
sequence comprising both a "specific region" and a "random region." A "specific binding 
region" when used herein refers to a sequence comprising a "specific region," but lacking a 
"random region." 

[0035] "Base pair" or "bp" when used herein refers to the association of two 
complementary nucleotides in a DNA or RNA molecule stabilized by hydrogen bonding. 

[0036] "GC content" as used herein refers to the percentage of guanine and cytosine 
bases in a particular oligomicleotide sequence. 
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[0037] "-mer" when used herein refers to a polynucleotide with the number of 
nucleotides specified immediately prior to "-mer." For example, "5-mer" refers to a 
polynucleotide consisting of 5 bases. 

[0038] "Primer" means an oligonucleotide sequence that is designed to hybridize 
with a complementary portion of a target sequence, a probe, or a ligation product, and undergo 
primer extension. A primer functions as the starting point for the polymerization of nucleotides 
(Concise Dictionary of Biomedicine and Molecular Biology, (1996) CPL Scientific Publishing 
Services, CRC Press, Newbury, UK). 

[0039] The term "degenerate primer" when used herein refers to a mixture of similar 
primers with differing bases at the varying positions (Mitsuhashi M, J Clin Lab Anal, 10(5):285- 
93 (1996); von Eggeling et al., Cell Mol Biol, 41(5):653-70 (1995); (Zhang et al., Proc. Natl 
Acad. Sci. USA, 89:5847-5851 (1992); Telenius et al., Genomics, \3(3)\l\%-25 (1992)). Such 
primers may include inosine as inosine is able to base pair with adenosine, cytosine, guanine or 
thymidine. Degenerate primers allow annealing to and amplification of a variety of target 
sequences that may be related. Degenerate primers that anneal to target DNA may function as a 
priming site for further amplification. 

[0040] The term "probe" when used herein refers to a polynucleotide that is capable 
of forming a duplex structure by complementary base pairing with a sequence of a target 
polynucleotide. For example, probes may be labeled, e.g. with a quencher moiety, or an energy 
transfer pair comprised of a fluorescent reporter and quencher. 

[0041] The term "set" when used herein refers to a multiplicity of primers. The 
minimum number of primers comprised within a set depends on the intended application, and 
may be determined based upon principles well known in the art, such as by using computer-based 
methods. An exemplary "set" may, for example, comprise between about 2 and 20 primers but 
the use of sets comprising more than 20 primers is also contemplated. 

[0042] The terms "duplex" means an intermolecular or intramolecular double- 
stranded portion of a nucleic acid which is base-paired through Watson-Crick, Hoogsteen, or 
other sequence-specific interactions of nucleobases. A duplex may consist of a primer and a 
template strand, or a probe and a target strand. A "hybrid" means a duplex, triplex, or other base- 
paired complex of nucleic acids interacting by base-specific interactions, e.g. hydrogen bonds. 

[0043] The term "anneal" as used herein refer to the base-pairing interaction of one 
polynucleotide with another polynucleotide that results in the formation of a duplex or other 
higher-ordered structure. The primary interaction is base specific, i.e., A/T and G/C, by 
Watson/Crick and Hoogsteen-type hydrogen bonding. 

[0044] The term "primer extension" means the process of elongating a primer that is 
annealed to a target in the 5' to 3 5 direction using a template-dependent polymerase. According 
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to certain embodiments, with appropriate buffers, salts, pH, temperature, and nucleotide 
triphosphates, including analogs and derivatives thereof, a template dependent polymerase 
incorporates nucleotides complementary to the template strand starting at the 3 '-end of an 
annealed primer, to generate a complementary strand. 

[0045] "Amplification" of a nucleic acid, as used herein, denotes the use of 
polymerase chain reaction (PCR) to increase the concentration of a particular DNA sequence 
within a mixture of DNA sequences. An "amplicon" is a target DNA sequence that is amplified 
by PCR. 

[0046] The term "real-time analysis" refers to periodic monitoring during PCR. 
Certain systems such as the ABI 7700 and 7900HT Sequence Detection Systems (Applied 
Biosystems, Foster City, CA) conduct monitoring during each thermal cycle at a pre-determined 
or user-defined point. Real-time analysis of PCR with FRET probes measures fluorescent dye 
signal changes from cycle-to-cycle, preferably minus any internal control signals. 

[0047] "5 '-nuclease analysis" or "5 '-nuclease assay" when used herein refers to "real- 
time analysis" for quantification of the amount of DNA amplified in a particular PCR reaction. 
TaqMan® analysis is an example of such "5 '-nuclease analysis." "5 '-nuclease analysis" involves 
the use of a fluorogenic oligonucleotide probe to which a reporter dye and a quencher dye are 
attached. During amplification of a nucleotide sequence using a forward and reverse primer, the 
probe anneals to the target of interest between the forward and reverse primer sites. During 
extension, the probe is cleaved by the 5 '-nuclease activity of the DNA polymerase. As the 
cleavage separates the reporter dye from the quencher dye, the reporter dye's fluorescence 
increases which may be detected and quantitated. Real-time analysis of PCR with 5 '-nuclease 
assay involves FRET probes that may be displayed by plotting the logarithmic change in detected 
fluorescence (ARn) versus the cycle number. The cycle within the PCR protocol at which the 
change in fluorescence (ARn) rises above a threshold value is denoted as C T . The C T cycle is 
approximately the cycle at which amplification of target becomes exponential. A relatively low 
C T value indicates efficient detection of amplicon. The threshold cycle is highly correlated to the 
amount of copy number, or amount of target polynucleotide present in the sample, as well as the 
efficiency of amplification. The effects of primer constitution, e.g. length, sequence, mismatches, 
analogs, can be conveniently screened and quantitated by measurement of C T values during real- 
time analysis of PCR. 

[0048] "FRET" when used herein refers to fluorescence resonance energy transfer, a 
quantum mechanical phenomenon that occurs between a fluorescence acceptor and a 
fluorescence donor. For example, the TaqMan® assay is a method that utilizes oligonucleotide 
probes that are labeled at each end with a fluorescence donor and acceptor pair and undergo 
FRET in their intact state. The probe is designed to hybridize to a sequence being amplified in a 
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PCR reaction using a polymerase. If a duplex forms between the probe and the target, the 5 f ~ 
exonuclease activity of the Taq® polymerase is able to digest the hybridized TaqMan® probe 
during the elongation cycle, separating the donor from the acceptor and resulting in a decrease in 
FRET and an increase in emission of fluorescence from the fluorescence donor. 

[0049] "Polymerase chain reaction" or "PCR" as used herein, refers to a method 
well-known in the art for amplification of a nucleic acid. The method involves introducing a 
molar excess of two or more extendable oligonucleotide primers to a reaction mixture comprising 
the desired target sequence(s), where the primers hybridize to opposite strands of the double 
stranded target sequence. The reaction mixture is subjected to a precise program of thermal 
cycling in the presence of a DNA polymerase, resulting in the amplification of the desired target 
sequence flanked by the oligonucleotide primers. The oligonucleotide primers prime multiple 
sequential rounds of DNA synthesis, each round of synthesis is typically separated by a melting 
and re-annealing step. Methods for a wide variety of PCR applications are widely known in the 
art, and are described in many sources, for example, Ausubel et al (eds.), Current Protocols in 
Molecular Biology, Section 15, John Wiley & Sons, Inc., New York (1994). 

[0050] "In silico e-PCR" when used herein refers to a computer-conducted method 
for predicting the size and probability of amplification of a nucleotide sequence using a particular 
set of primers. The method involves searching a DNA database for exact matches to the primer 
sequences and further for sequences having the correct order, orientation, and spacing to allow 
priming of amplification of a nucleotide sequence of a predicted size. 

[0051] "Single Nucleotide Polymorphism" or "SNP" when used herein refers to a 
variation in a single nucleotide in a genomic sequence. In one embodiment, such "SNP"s may be 
functionally significant and may play a role in susceptibility to disease. 

[0052] "SNP genotyping" when used herein refers to detecting whether a particular 
SNP is present in a DNA sample isolated from a particular cell or individual. 

[0053] "Tm" as used herein, refers to the melting temperature (temperature at which 
50% of the oligonucleotide is a duplex) of the oligonucleotide calculated using the nearest- 
neighbor thermodynamic values of Breslauer et al (Proc. Natl. Acad. Sci USA 83:3746-3750, 
1986) for DNA and Freier et al (Proa Natl. Acad Sci USA 83:9373-9377, 1986) for RNA. 

[0054] "Stoffel fragment" when used herein refers to an enzyme that is a modified 
form of recombinant AmpliTaq® DNA Polymerase. This enzyme is more thermostable than 
AmpliTaq DNA Polymerase, is active over a 2mM-10mM range of magnesium iron 
concentration, and lacks 5 '-3 5 exonuclease activity. 

[0055] "Genomic DNA" or "gDNA" refers to all the DNA sequences comprising the 
entire genome of an organism. 
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[0056] "Complementary DNA" or "cDNA" when used herein refers to a DNA copy 
of a mRNA. Such "cDNA" lack nitrons present in genomic DNA, 

[0057] "Messenger RNA" or "mRNA" when used herein refers to RNA that 
specifies the order of amino acids in protein synthesis. 

[0058] The term "end-point analysis" refers to a method where data collection 

occurs only when a PGR reaction is substantially complete. 

[0059] The term "significant homology" when used herein means perfect 

complementarity between bases or alternatively, no more than 1 or 2 mismatches of bases. 

B. Universal-tagged Primer Design 

[0060] Based on knowledge of the sequences of entire genomes, it is possible to 
compute the minimal number of primers needed to amplify all the genes within a particular 
genome. For example, such genome-directed primer strategies predicted that 37 primers (7-mers 
or 8-mers) were able to prime the 3,924 genes in the Mycobacterium tuberculosis genome (Talaat 
et al., Nature Biotech, 18:679-682 (2000)). In addition, the completion of the approximately 3 
billion base pairs sequence of the human genome enables the design of primers based on whole- 
genome scale instead of a random approach. 

[0061] The present invention provides a sequence-directed strategy to amplify 
fractions of the entire genome, including the whole human genome. This strategy involves the 
design of primers (Figure 7) comprising the following regions: (1) a specific region (typically 
about 5 bases long), (2) a random region (typically about 4 bases long), and (3) a universal 
region (typically about 18 bases long). The specific region, which is located on the 3 '-end of the 
primer, can anneal to many specified sites in the genome. The random region helps to facilitate 
binding of the primer to target DNA and binding of the polymerase enzyme used in PCR 
amplification to the priming site. The universal region is a priming site for further PCR 
amplification with a complementary primer. The role of each region in the PCR amplification 
process will be described in greater detail below. 

[0062] In general PCR primers are designed using a core set of parameters, such as 
primer length, melting temperature (T m ) 5 GC content, complementarity to optimize the PCR 
products, primer bending and folding, weighting these and similar parameters to different degrees 
depending on the specific task. 

[0063] Since both specificity, and the temperature and time of annealing are at least 
partly dependent on primer length, the total length of the primers is critical for the successful 
performance of the PCR amplification reaction. In general, longer primers are typically less 
efficient than shorter primers. Accordingly, this is taken into account when determining the 
length of the individual regions within the primers herein. In general, the total length of the 
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primers may be between about 18 bases and about 65 bases, such as, for example, between about 
23 and 45 bases or between about 27 and 34 bases. 

[0064] In the primers of the present invention, the specific region is designed to 
specifically bind short sequences occurring with high frequency in the genome to be amplified. 
The length of the specific region depends mainly on the averaged lengths of the predicted PCR 
products based on bioinformatic calculations. In general, the longer the specific region is, the 
more primers are needed to amplify the same number of regions of the genome, because longer 
specific regions are more rare and typically are spaced farther apart. Thus, based upon stability 
and efficiency of duplex formation, the use of a sequence-specific region of about 9 bases would 
appear ideal. However, unique sequences of 9 bases in length are rare in the genome, therefore, 
the amplification of the same number of regions of the genome, using primers with specific 
regions of 9 bases would require more primers than the same amplification using primers with 
shorter specific regions. In order to determine the frequency of 5-mers in the whole human 
genome, the occurrence of all 1024 possible 5-mers were counted. The results are shown in 
Figure 1. For the top 50 most frequent 5-mers (Table 1), the average distance between a given 5- 
mer is about 200-500 bp. In addition, when one of the top 20 most frequent 5-mers and 6-mers in 
the human genome are used for amplification of the human genome, bioinformatic calculations 
show that the amplified products average 74-523 bps. Thus, primers with specific regions less 
than 9 bases are contemplated. Based on the foregoing considerations, the primers of the present 
invention specifically include specific regions of between about 4 and 12 bases in length, 
although the use of primers with longer or shorter specific regions, for example between about 4 
and 8 bases in length or between about 5 and 6 bases in length is also contemplated. When the 
specific region is 5 bases in length, the sequence of the 3 5 specific region may be selected from 
the group consisting of the primers listed in Table 1 . 

[0065] The random region is used to adjust the performance of the primer, both in 
terms of thermodynamics of the primer binding to the target (e.g., T m of primer), and an expected 
stabilizing effect of having a longer duplex for the polymerase enzyme to bind. The length of the 
random region of the primer depends on the length of the specific primer and the fact that the 
minimum duplex required for good enzyme activity is expected, as mentioned above, to be about 
9 bases, but also depends on the presence of high T m nucleobase analogs. The random region 
may be between about 2 and 15 bases in length, between about 4 and 12 bases in length or 
between about 4 and 6 bases in length. A duplex of such length also provides the primer with a 
favorable thermal melting temperature (T m ). Thermal melting, T m , values were estimated for 
DNA primers and DNA probes by calculations using the basic formula: T m = 81.5 + 
16.6(logio[Na + ]) + 0.41(%G+C) - (600/N), where N = oligonucleotide length in number of 
nucleotides (Bolton et al., Proc. Natl Acad. Set, 48:1390 (1962); Sambrook et al., Molecular 
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Cloning, A Laboratory Manual, Second Edition, pgs. 2:11.46, 9.50-9.51, (1989). Refinements to 
the basic formula may be made for nearest-neighbor and solvent effects (Blackburn and Gait 
(editors), "DNA and RNA structure," Nucleic Acids in Chemistry and Biology, 2 }ld edition, 
Oxford University Press, pgs. 70-71 (1996); Breslauer et al., Proc. Natl. Acad. Set USA, 
83:3746-50 (1986)). In calculations done using a current nearest-neighbor model (Figure 6), this 
length of primer will have an average T m of about 20 °C for a 9-mer (with a universal region 
under typical buffer conditions). However, even with such a low T m , the predicted fraction of the 
duplex formed at higher temperatures is not zero. In fact, at 50 °C, where the polymerase enzyme 
has high activity, there is still a small fractional amount of hybridized primer (about 0.1 to 1% at 
high, e.g. mM, primer concentrations). Accordingly, in primers containing a 5 base specific 
region, the random region may be about 4 bases, although longer or shorter random regions are 
contemplated. When the random region is 4 bases in length, the sequence is selected from all 
possible combinations of A, C, T and G bases. Preferably, such random regions do not include 
combinations occurring in the genome with high frequency. 

[0066] The universal region of the primer is designed to have no significant 
homology to any segment in the genome and serves to function as sites to which universal 
primers anneal for future cycles of amplification. The universal region must meet ail the 
requirements for a normal PCR primer, such as a lack of secondary structure, an appropriate Tm 
(typically between about 55 and about 65 °C) and appropriate GC content (typically about 40- 
60%). The sequence of the 5 ? universal region may, for example, be selected from the group 
consisting of: AGTGGTGCCAGCTCAGCAG (SEQ ID NO: 1 ) 

CATGGAGTGTTGGCCAGGA (SEQ ID NO: 2), AACTCTCTCCCAAGAGCGA (SEQ ID NO: 
3); TAGCGTAGTGAGCATCCGT (SEQ ID NO: 4); TCCCACATTCACCGCTTAG (SEQ ID 
NO: 5); TGTAAAACGACGGCCAGT (SEQ ID NO: 6); and GTACGCAGTCTACGAGGC 
(SEQ ID NO: 7). The length of the universal region will be affected by the lengths of the specific 
and random regions, and the total length of the primer. The length may be between about 12 and 
35 bases in length, between about 15 and 25 bases in length or between about 18 and 22 bases in 
length. Typically, the whole primer, including the universal region (providing a universal 
priming site) may be between about 1 8 bases and about 65 bases, between about 23 and about 45 
bases or between about 27 and about 34 bases. 

[0067] The primers of the invention may include locked nucleic acid (LNA) bases. 
LNA are modified DNA analogs, which obey Watson-Crick base pairing and Hoogsten triplex 
formation rules, and have improved hybridization characteristics and biostability (see, e.g., Singh 
et al, Chem. Commun. 455-456 (1998); Koshkin et al, Tetrahedron 54:3607-3630 (1998)). 
Under appropriate conditions, and with careful primer design, LNA bases can increase Tm, and 
improve the efficiency of nucleic acid amplification. 
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Table 1 : 50 Most Frequent 5-mers in the Human Genome 



PCT/US03/14491 



5-mer 


C-5-mer 


Counts 


Frequency (%) 


AAAAA 


TTTTT 


34196410 


1.34318 


AAAAT 


ATTTT 


20414954 


0.80187 


AAATA 


TATTT 


16864439 


0.66241 


TAAAA 


TTTTA 


16839317 


0.66142 


AGAAA 


TTTCT 


16045399 


0.63024 




TTTAT 


14223540 


0.55868 


AAATT 


AATTT 


14108156 


0.55415 


TTAAA 


TTTAA 


13753076 


0.5402 


GAAAA 


TTTTC 


13720656 


0.53893 


AATAA 


TTATT 


13638528 


0.5357 


CAAAA 


TTTTG 


13576003 


0.53324 


AAGAA 


TTCTT 


13289245 


0.52198 


AAAGA 


TCTTT 


13283223 


0.52174 


AAAAG 


CTTTT 


12972403 


0.50954 


AAATG 


CATTT 


12214057 


0.47975 


AAACA 


TGTTT 


12038523 


0.47285 


ACAAA 


TTTGT 


12004251 


0.47151 


ATTTA 


TAAAT 


11635302 


0.45702 


TGAAA 


TTTCA 


11510593 


0.45212 


ATATA 


TATAT 


11393381 


0.44751 


AATAT 


ATATT 


11353367 


0.44594 


AAAAC 


GTTTT 


10747752 


0.42216 


AGAGA 


TCTCT 


10693461 


0.42002 


AATTA 


TAATT 


10254452 


0.40278 


AGGAA 


TTCCT 


9935310 


0.39024 


AACAA 


TTGTT 


9923966 


0.3898 


CCCAG 


CTGGG 


9871598 


0.38774 


TCAAA 


TTTGA 


9698717 


0.38095 


ATTTC 


GAAAT 


9622174 


0.37794 


ATAAT 


ATTAT 


9603349 


0.3772 


ACACA 


TGTGT 


9546388 


0.37497 


CAGAA 


TTCTG 


9465179 


0.37178 


AGAAT 


ATTCT 


9299112 


0.36526 


GAGAA 


TTCTC 


9286534 


0.36476 


GGAAA 


TTTCC 


9272767 


0.36422 


TCCCA 


TGGGA 


9183471 


0.36071 


ATTAA 


TTAAT 


9159692 


0.35978 


ATGAA 


TTCAT 


9130836 


0.35865 


AAAGT 


ACTTT 


9098500 


0.35738 


AATGA 


TCATT 


9095115 


0.35724 


TTCAA 


TTGAA 


9016495 


0.35415 


ATTTG 


CAAAT 


8987366 


0.35301 


TCTCA 


TGAGA 


8981902 


0.3528 


ACAGA 


TCTGT 


8979579 


0.3527 



14 



WO 03/097794 PCT/US03/14491 
C. Use of universal-tagged primers to amplify fractions of the genome 
[0068] Practical issues hinder the amplification of all 3 billion base pairs of a whole 
genome in a single PCR reaction. Such limitations include the availability of primers and dNTPs, 
and the total amount of DNA that can be amplified in a single tube. For example, with 2 mM 
dNTPs in a 100 jlxI volume, one PCR reaction can make about 1.6 X 10 8 copies of the genome 
which is about 2.7 pM of the genomic DNA (gDNA). To form 1.5 X 10 7 fragments with the 
average size of 200 base pairs, the reaction needs 2.4 X 10 15 pairs of primers to make 1.6 X 10 8 
copies for each using approximately a primer concentration of 40 pM in each 100 jxL PCR 
reaction. 

[0069] Because of the practicality limitations, a more feasible approach is to amplify 
partial regions of the whole genome using a set of primers designed as discussed above, for 
example a set of primers including a sequence-specific region, a random region, and a universal 
adapter region (universal tag) (Figure 7). Such primers will be briefly referred to as universally 
tagged primers. In silico "e-PCR" (Schuler, G.D., Genome Res, 7(5):541-50 (1997); Schuler, 
GD, Trends Biotechnol, 16(ll):456-9 (1998)) of gDNA may be used to predict the product, 
expected size of the product, and probability of amplifying a particular region of the genome with 
a set of such primers in a particular PCR reaction. Accordingly, primer sets can be pooled based 
on such computer-based in silico "e-PCR" of gDNA. A possible format is to first amplify the 
whole genome in separate PCR reactions in a 96-well plate, with each well containing selected 
primer sets designed as discussed above, such as universally tagged primer sets. The amplified 
products can then be further amplified with the two universal primers that are complementary to 
the universal priming sites, i.e. Ul or U2, of the universal-tagged primers (Figure 3), e.g. 9-mers. 

1. Efficiency of Amplification 

[0070] In one embodiment of the invention, the efficiency of amplification of 
sequences from gDNA by PCR using universal-tagged primers may be determined. Universal- 
tagged primers may be incubated in a PCR reaction mixture containing gDNA, dNTPs, and DNA 
polymerase and subjected to specific thermal cycling conditions. The PCR products may further 
be subjected to 5'-nuclease assay. The efficiency of the amplification using universal-tagged 
primers may be determined by comparing the results from the 5 '-nuclease assay on the PCR 
products with results of 5 '-nuclease assay performed on a known concentration of gDNA not 
subjected to PCR amplification. Further efficiency may be determined by subjecting the PCR 
products generated from reactions containing universal-tagged primers to gel electrophoresis 
analysis . The presence of smearing in the gel electrophoresis analysis suggests "random 
priming' 5 of the universal-tagged primers on the genomic DNA. 
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2. Further Amplification 
[0071] In another embodiment, the present invention provides a method for further 
amplification of PGR products initially generated by the universal-tagged oligonucleotide 
primers. Such further amplification involves the use of the universal tag in the universal-tagged 
primer as a site that allows annealing of universal primers for the priming of further amplification 
cycles. 

[0072] The method for further amplification involves 5 '-nuclease assay analysis 
which includes a second PGR reaction, containing the PGR products resulting from the initial 
reaction and universal primers having homology to the universal region of the universal-tagged 
primers used in the initial PCR reaction. The yield of DNA amplified in this second round of 

> 

amplification may be quantitated from the 5 5 -nuclease assay. 
D. Applications of the Invention 

[0073] Embodiments of the present invention include application of the invention to 
processes that rely on significant amplification of DNA. 

[0074] One such process that relies on amplification of DNA is SNP genotyping 
which involves mapping and associating genetic variations with particular diseases and is 
especially beneficial to the study of the genetic basis of complex disease traits. SNP genotyping 
is of significant value to the treatment and diagnosis of particular diseases and further for 
pharmacogenetics which focus on identifying the genetic elements that underlie variations in 
patients with respect to drug responsiveness or toxicity. 

[0075] Although a number of varying methods for SNP genotyping exist, most 
methods often require a quantity of gDNA from a particular individual or group of individuals 
that is sufficient for the genotyping. The isolated DNA sample for such genotyping is often 
subjected to an initial and critical DNA amplification step. Accordingly, one embodiment of the 
invention involves applying the universal-tagged primer method of DNA amplification to SNP 
genotyping. For example, DNA to be analyzed by SNP genotyping may be amplified using the 
universal-tagged primer method of the present invention and subsequently subjecting the 
amplified to SNP genotyping analysis. Other embodiments include applying the present 
invention to other multiplex genetic applications that involve DNA amplification. 

EXAMPLES 

[0076] The following examples are offered by way of illustration and not by way of 
limitation. The examples are provided so as to provide those of ordinary skill in the art with a 
complete disclosure and description of how to make and use the compounds, compositions, and 
methods of the invention and are not intended to limit the scope of what the inventors regard as 
their invention. Efforts have been made to insure accuracy with respect to numbers used (e.g. 
amounts, temperature, etc.) but some experimental errors and deviation should be accounted for. 
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The disclosures of all citations in the specification are expressly incorporated herein by 
reference. 

EXAMPLE 1 
Universal-Tagged Primers 

[0077] The substitution of nucleic acids in a primer with nucleic acid analogues, 
having backbone modifications (Amberg et al., Nucleosides Nucleotides Nucleic Acids, 20(4- 
7): 1275-8 (2001); Obika et al., Bioorg Med Chem, 9(4):1001-11 (2001); Jones et al. 5 Nucleic 
Acids Res, 24(21):41 17-22 (1996)) may affect the T m of the nucleic acid sequence. For example, 
locked nucleic acid (LNA) may induce an increase in the thermal stability (melting temperature 
(Tm)) of duplexes toward complementary DNA (Wahlestedt et al., Proc. Natl Acad. Set USA, 
97(10) 5633-5638 (2000); Petersen et al., JMol Recognita 13(l):44-53 (2000)). 

[0078] The present example demonstrates the effects of the substitution of 
oligonucleotide primers with conformationally modified nucleic acid analogues. 

A. Effect of LNA substitution 

[0079] To examine the effects of LNA substitution on DNA amplification 
efficiency, primers of 9 or 10 bases in length that have LNA base substitutions (Table 2) and 
designed to have homology with the synthetic target were incubated with a synthetic target (Table 
2), containing sites for the LNA substituted primers, the 5 '-nuclease probe, 5 '-nuclease forward 
and reverse primers, dNTPs, and DNA polymerase and subjected to specific thermal cycling 
conditions. The yield of the PGR products resulting from the reaction were quantitated by 5'- 
nuclease analysis. 

[0080] The template contains sites (5'-NNNNaaaaaa (SEQ ID NO: 8) and 5'- 
ttttttNNNN (SEQ ID NO: 9) for binding of the LNA-substituted primers. For further 5 '-nuclease 
analysis of the PCR products generated, the template contains sites for binding of the 5 '-nuclease 
probe, forward and reverse primers (Table 2). In the sequences in Table 2, a base in bold and 
upper case (e.g. T) represents a LNA base, the lower case t is the normal base, and n is a 
degenerate base (A or C or G or T). 
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Table 2: Sequences of synthetic template and LNA substituted primers 



Primer Name 


Sequence (5 '-3') 


Template 


tttttnnnncctagcgtagtgagcatccgtaagcattcatcgtgaggg 
gtctttgtcctcctatggtagcagtcacgaggcatnnnnaaaaaa 
(SEQIDNO: 10) 


5 '-nuclease Forward Primer 


cctagcgtagtgagcatccgt (SEQ ID NO: 11) 


5 '-nuclease Reverse Primer 


atgcctcgtgactgctacca (SEQ ID NO: 12) 


5 '-nuclease Probe 


FAM-aagcattcatcgtgaggggtctttgtcctccta (SEQ ID 
NO: 13)-TMR 


LNA9T5n4 


TTTTTnnnn (SEQ ID NO: 14) 


LNA9T6n3 


TTTTTnnn 


LNA10T6n4 


TTTTTTnnnn (SEQ ID NO: 15) 



[0081] The PCR mixture includes 5 |LiL of 100 fM of the synthetic target, 5 pL of 
either the 9-mer and 10 mer, having a binding region, (10 |oM), 5 juL of 10X Buffer, 5 yL of 25 
mM MgCl 2 , 5 jxL of 2.5 raM dNTPs, 1 jjL of Taq-Gold polymerase (5U/|jL), and 24 p,L of 
distilled deionized H 2 0 (ddH 2 0). The conditions for the PCR cycles were an initial denaturation 
at 95°C for 10 minutes, followed by 30 to 40 cycles of the following: 95°C for 30 seconds, either 
55°C or 45 °C for 3 minutes and 72°C for 1 minute. 

[0082] Following the PCR procedure, the PCR products were diluted by 1 00-fold 
prior to 5 '-nuclease analysis. The yields were quantified by 5 '-nuclease real-time analysis, and 
the results were summarized in Table 3 . 



Table 3. 5 '-nuclease analysis results of PCR amplification using LNA substituted primers 



Primer Used 


PCR Yield @ 55°C 


PCR Yield @ 45°C 


LNA9T5n4 


0.0026 nM 


0.0034 nM 


LNA10T6n4 


0.23 nM 


0.2nM 


LNA9T6n3 


0.012 nM 


0.01 nM 


Positive Control 


3.3 nM 


3.3 nM 


No primer 


0.002 nM 


0.0059 nM 


No template 


0 


0 



[0083] Before amplification, the initial concentration of the template was 10 fM. 
The annealing temperature at 55°C and 45°C resulted in similar yields in all cases. Here, a 10- 
mer primer, having a binding region, substituted with 6 LNA bases is able to amplify the 
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synthetic target about 2.3 X 10 4 -fold after 30 cycles of PCR. 9-mer primers, having a binding 
region, substituted with LNA bases resulted in little amplification under the PCR conditions. 
B . Effect of univer s al-tagging of primers 

[0084] The substitution of LNA bases are further examined for their effect on the 
efficiency of amplification using short, oligonucleotide primers fused to a universal tag. LNA 
substituted primers may be expected to increase the Tm of the primer. 

[0085] To determine whether LNA substitution of bases in universal-tagged specific 
primers, lacking random regions, has an effect on the efficiency of PCR amplification, real-time 
analysis was performed on 5 '-nuclease assay PCR reactions containing a synthetic template 
(Table 4), having sites for the universal-tagged specific primers, the 5 '-nuclease forward and 
reverse primer, 5 5 -nuclease probe and universal-tagged specific primers, specifically designed to 
have homology with the synthetic template and to contain base substitution with either 0, 1, 2, 3 
or 5 LNA bases was performed. The efficiency of amplification was analyzed by subjecting the 
PCR products to gel electrophoresis analysis and 5 '-nuclease analysis. Further, quantitative 
analysis of the copy efficiency was obtained by the 5 '-nuclease analysis. 

[0086] The schematic for such an experiment is depicted in Figure 4. During the 
thermal cycles of the PCR reactions described below, the universal-tagged specific primer 
anneals to the primer site and extends to make a copy of the template, Subsequently, the forward 
primer extends along the copy of the template that has LNA bases and a universal tag. If the Taq 
DNA polymerase can copy the LNA bases efficiently, the PCR product will be extended through 
the LNA bases and form a longer template incorporating the universal primer site. 

[0087] For such PCR reactions, the template (Table 4) contains a site (5 5 - 
aaaaaaaaaaaaaa) (SEQ ID NO: 16) for binding of the universal-tagged specific primer. Further, 
the template contains sites for binding of 5 5 -nuclease forward (5'-cctagcgtagtgagcatccgt) (SEQ 
ID NO: 17 ) and reverse (5'-tggtagcagtcacgaggcat) (SEQ ID NO: 18) primers, and a 5 '-nuclease 
probe (5'-gaagtggcaacagagaaggaa) (SEQ ID NO: 19). The uppercase and bold letters (e.g. T) in 
column 2 of Table 4 represents a LNA base and the lower case t is the normal base. 
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Table 4: 5 5 -nuclease Ct values of LNA-substituted primers 



1 

Primer 
Name 


Sequence (5'-3 ? ) 


Notes 


Ct 

Value 






Positive Control 


15.7 


9t-U 


atgecleglagaclgGglacglUUUU (SEQ 
ID NO: 20) 


Univei*sal-tagged 9 base specific 
primer, 0-LNA 


18.6 


lOt-U 


algcclcglagaoLgcglacgUlULLUt (SEQ 
ID NO: 21) 


Universal-tagged 10 base specific 
primer, 0-LNA 


17.5 


lOt-l-LNA 


atgcctcgtagactgcgtacgttttTttttt (SEQ 
ID NO: 22) 


Universal-tagged 10 base specific 
primer, 1-LNA 


17.0 


10t-2-LNA 


atgcctcgtagactgcgtacgttTtTttttt 
(SEQ ID NO: 23) 


Universal-tagged 10 base specific 
primer, 2-LNA 


17.7 


10t-3-LNA 


atgcctcgtagactgcgtacgttTtTtTttt 
(SEQ ID NO: 24) 


Universal-tagged 10 base specific 
primer, 3-LNA 


25.4 


10t-5-LNA 


atgcctcgtagactgcgtacgtTTTTTtttt 
(SEQ ID NO: 25) 


Universal-tagged 10 base specific 
primer, 5-LNA 


40 


No template 




Negative Control 


40 


Template 


cctagcgtagtgagcatccgtaGAAGTGG 
CAACAGAGAAGGAAtggtagcagt 
cacgaggcataaaaaaaaaaaaaa (SEQ ID 
NO :26) 







[0088] For the PCR reactions, the initial concentration of the template was 1 00 fM 
and the concentrations of the primers from Table III were 1 mM in a 25 mL reaction volume. 
Specifically, the 5 5 -nuclease PCR mixture contains 2.5 pL synthetic template (lpM), 1.25 juL 5'- 
nuclease probe 1 13 (5 pM), 2.5 |LiL forward 5 5 -nuclease primer 115 (1-10 pM), 3.75 pL ddH 2 0, 
12.5 pL 2X Master mixture for 5 '-nuclease, and 2.5 pL primers selected from Table m (1.6 nM- 
10 pM) from Table HI. The 5 5 -nuclease PCR mixture is subjected to an initial incubation for 2 
minutes at 50°C, an incubation for 10 minutes at 95°C, and 50 cycles of the following: 15 
seconds at 95°C, 1 minute at 50°C, and 1 minute at 60°C. 

[0089] Upon completion of the thermal cycles, an aliquot of the PCR products was 
subjected to gel electrophoresis analysis. 

[0090] The gel electrophoresis analysis of the PCR products showed that the control 
experiment with normal primers formed the 60 bp products. However, the products with forward 
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primer and the universal tagged 9-mer or 10-mer ? having a specific binding region, (referred to in 
Table HI as 9t and lOt, respectively), were 20 bp longer. The increase in length indicated that 
these PCR products included a 20 base universal tag. The products of primers, 9t-U, 10t-U,10t-l- 
LNA, and 10t-2-LNA had clear bands on the gel while the products of primers, 10t-3-LNA and 
10t-5-LNA gave no detectable bands. 

[0091] The 5 '-nuclease results are summarized in Table 4. Cycle threshold (Ct) 
values indicate that the higher Tm provided by substitution with LNA bases did not correlate 
with greater efficiency in PCR amplification. For example, the primer 10t-5-LNA that has 5 
LNA bases did not amplify the template. It may be that the enzymes cannot copy 5 LNA bases 
effectively. The primers with no LNA base substitutions, 9t-U and lOt-U, had Ct values of 18.6 
and 17, respectively, and were able to significantly amplify the template with efficiency similar 
to amplification with normal specific primers. The primers with LNA substitutions amplified the 
template with similar efficiencies or worse efficiencies than efficiencies using primers without 
LNA substitutions. The substitution with LNA bases which increases the Tm did not 
significantly affect the efficiency of PCR amplification. For example, 10t-3-LNA (Ct+25.41) 
was significantly less efficient, although the Tm of 10t-3-LNA primer is higher than that of 10t-l- 
LNA, 10t-2-LNA, and lOt-U. C T represents the cycle at which amplification becomes 
exponential. 

EXAMPLE 2 

Method of Using the Universal-tagged Oligonucleotide Primers 
A. Amplification of gDNA 

[0092] To determine whether universal-tagged primers can amplify gDNA, 
amplification of human gDNA was performed using exemplary universal-tagged primers (Figure 
7), containing a 17 base universal region, a 4 base random region, and a 5 base specific region. 
In this particular instance, the specific region was not selected from the top 20 most frequently 
occurring 5-mers in the human genome (Figure 1). The COX6b gene was selected as a test target 
for the gDNA amplification, based on bioinformatic e-PCR predictions. The efficiency of the 
amplification using a set of universal-tagged primers was compared to the efficiency calculated 
from amplification using different sets of primers, selected from Table IV. The schematic 
sequence structure of the human COX6b gene (SEQ ID NO: 35) is represented in Figure 5 with 
the location of each of the primer sites clearly designated. 

[0093] The oligonucleotide primers used for amplification of gDNA are described in 
Table 5. The universal tagged sequences were 5 '-CATGGAGTGTTGGCCAGGA (U2) (SEQ ID 
NO: 2) and 5 '-AGTGGTGCCAGCTCAGCAG (Ul) (SEQ ID NO: 1). N represents one of the 
normal bases, A, C, G, or T. 
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Table 5: Sequences of primers used in primer sets for amplification of gDNA 



Primer Name 


Sequence (5'-3') 


COX6b-Fa-U2 


CATGGAGTGTTGGCCAGGAGGTCACAGA (SEQ ID NO: 27) 


COX6b-Fb-U2 


CATGGAGTGTTGGCCAGGAGGGATGAGC (SEQ ID NO: 28) 


COX6B-Ra-Ul 


AGTGGTGCCAGCTCAGCAGCCTGGGAGA (SEQ ID NO: 29) 


U2-GGCTG 


CATGGAGTGTTGGCCAGGANNNNGGCTG (SEQ ID NO: 30) 


Ul-GGAGA 


AGTGGTGCCAGCTCAGCAGNNNNGGAGA (SEQ ID NO: 31) 


COX6-F58 


AGCAACGGGCTGAAGGC (SEQ ID NO: 32) 


COX6-R58 


TGGAGGACAGAGGAAAGGGA (SEQ ID NO: 33) 


COX-Probe 


FAM-AGATGCAGCCAGTTCAGATCTTCCCG (SEQ ID NO: 34)-TMR 



[0094] The PCR reactions contained the following: 2 mL gDNA CEPH 1347-02 (50 
ng/pL) (100 ng genomic DNA), according to the combinations outlined in Table V, 2 pL of each 
of the two primers (10 pM) selected from Table IV, 2 pL 10X Buffer, 2 pL MgCL 2 (25 mM), 2 
pL dNTPs (2.5 mM), 1 pL Taq-Gold polymerase (5U/pL), 7 p.L ddH 2 0. The reactions were 
subjected to thermal cycling according to the following: an initial incubation for 10 minutes at 
95°C, 30 cycles of 30 seconds at 95°C, 1 minute at 50°C, 1 minute at 72°C, and a final incubation 
for 7 minutes at 72°C, before incubation at 4°C. 

[0095] The PCR products were subjected to gel electrophoresis analysis and 5'- 
nuclease analysis. For gel electrophoresis analysis, an aliquot of the PCR products were 
separated on an agarose gel by gel electrophoresis. Gel electrophoresis analysis of the PCR 
products from reactions containing the normal F58 and R56 specific primers showed the 
production of one correct main product and some longer fragments. Gel electrophoresis analysis 
of the PCR products from reactions containing the universal-tagged 9-mers, having a binding 
region, showed the production of a number of smeared bands that resulted from the "random 
priming" of the universal-tagged 9-mers, having a binding region, on the genomic DNA. 

[0096] For 5 '-nuclease analysis, the PCR products from the reactions containing 
various primer combinations were diluted 1000-fold. The reactions subjected to 5 '-nuclease 
analysis were as follows: 1.25 pL 5 '-nuclease or FRET probe (5 pM), 2.5 pL F58 primer (1 
pM), 2.5 pL R58 primer (1 pM), 3.75 pL ddH 2 0, 12.5 pL 2X Master mixture for 5 ? -nuclease, 
and 2.5 pL of the 1000-fold diluted PCR product. The thermal conditions for 5'-nuclease 
analysis were 2 minutes at 50°C, 10 minutes at 95°C, and 40 cycles of 15 seconds at 95°C and 1 
minute at 60°C. 

[0097] In result (Table 6), sets of primers containing a specific 5-mer region, a 4- 
mer random base region, and a universal tag (Figure 7) were able to amplify the COX6b gene 
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from 100 ng of gDNA about 500-fold when the results from the 5 '-nuclease results were 
compared to 5 5 -nuclease results performed with 100 ng gDNA. 



Table 6: 5 '-nuclease Ct values using sets of primers, either universal-tagged primers or 
universal-tagged specific primers, as PCR primers. 



-A. V>X\ JL I IluCI 


dilution 


JL I CUXtlCU v-» t. 


F58 + R58 


11.9 




F58 + COX-Ra-Ul 


14.11 




COX-Fa-U2 + R58 


16.25 




COX-Fa-U2 + COX-Ra-Ul 


18 06 


18 


COX-Fb-U2 + R5 8 


16.61 




COX-Fb-U2 + COX-Ra-Ul 


19.72 


18 


U2-GGCTG + R5 8 


16.9 




U2-GGCTG + COX-Ra-Ul 


16.6 




F58 + Ul-GGAGA 


16.8 




COX-Fa-U2 + Ul-GGAGA 


16.3 


16 


U2-GGCTG + Ul-GGAGA 


17.7 


18 


100 ng gDNA 


17.7 




H 2 0 


40 




H 2 0 + COX-Ra-Ul 


40 





[0098] B. Further amplification of universal-tagged oligonucleotide primer- 
generated PCR products using universal primers 

[0099] PCR products generated using sets of universal-tagged specific primers, not 
containing a random region, were amplified using universal primers. An initial PCR reaction, 
containing a synthetic template (Table 7), was performed with two 9-mer primers, having a 
specific binding region, (5 '-CCTGGAGA and 5 '-GGTCACAGA; and tagged with either Ul (5'- 
AGTGGTGCCAGCTCAGCAG) (SEQ ID NO: 1) or U2 (CATGGAGTGTTGGCCAGGA (SEQ 
ID NO: 2) universal tags, respectively, to generate COX-Ra-Ul and COX-Fa-U2, respectively 
(Table 6). The PCR products were diluted 1000-fold and subjected to 5 '-nuclease analysis using 
universal primers, Ul and U2. 
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Table 7: Sequences of universal-tagged specific primers and universal primers 



Name 


Sequence (5'-3') 


Template 


GGTCACAGAcgggaagatctgaactggctgcatctttTCTCCCAGG (SEQ ID NO: 36) 


COX-Ra-Ul 


AGTGGTGCCAGCTCAGCAGCCTGGGAGA (SEQ ID NO: 29) 


COX-Fa-U2 


CATGGAGTGTTGGCCAGGAGGTCACAGA (SEQ ID NO: 27) 


Universal Ul 
Primer 


AGTGGTGCCAGCTCAGCAG (SEQ ID NO :1) 


Universal U2 
Primer 


CATGGAGTGTTGGCCAGGA (SEQ ID NO: 2) 


Taq-probe 


Fam-AGATGCAGCCAGTTCAGATCTTCCCG (SEQ ID NO: 34)-TMR 



[0100] For the initial PCR reaction, 2 ]uL of 1 pM template and 2 jxL of each of the 
10 pM universal-tagged specific primers, COX-Ra-Ul and COX-Fa-U2, were incubated with 2 
\xL of 10X buffer, 2 pL of 25 mM MgCl 2 , 2 pL of 2.5 mM dNTP, 1 jlxL of 5U/]aL Taq-Gold 
polymerase. The thermal profile for the PCR reaction was an initial incubation for 10 minutes at 
95°C, followed by 30 cycles of 30 seconds at 95°C, 1 minute at 50°C, and 1 minute at 72°C, a 
final cycle of 7 minutes at 72°C 5 and the reaction was stopped at 4°C. 

[0101] The PCR products were diluted 1000-fold and subjected to 5 '-nuclease 
analysis using the Ul and U2 universal primers described in Table 7. The reaction mixture 
subjected to 5'-nuclease analysis consisted of the following: 1.25 juL of 5 |iM 5'-nuclease probe 
(Table 6), 2.5 jllL of 1 \iM universal Ul primer, 2.5 pL of 1 juM universal U2 primer, 3.75 \xL 
ddH20, 12.5 |uL 2X Master mix containing enzyme and buffer, 25 \xL of the 1000-fold diluted 
PCR product that resulted from the COX-Fa-U2 and COX-Ra-Ul primed reaction described 
above. The thermal conditions for the 5 '-nuclease assay was a 2 minute incubation at 50°C, 
followed by a 10 minute incubation at 95°C, followed by 50 cycles of 15 seconds at 95°C and 1 
minute at 60°C. 

[0102] The results of the 5'-nuclease analysis are summarized in Table 8. The 
products generated using the universal-tagged specific primers, COX-Ra-Ul and COX-Fa~U2, 
were amplifiable using the universal Ul and U2 primers. 
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Table 8: 5 '-nuclease assay Ct values from the second-round PCR reactions performed with 
universal primers 



PCR primer pairs in Initial 
PCR Reaction 


Ct values 


Template 


COX-Ra-Ul and COX-Fa-U2 


16.7 


PCR products of template, 
1 000-fold dilution 


COX-Ra-Ul and COX-Fa-U2 


50 


No template (control) 



[0103] All references cited throughout the specification are hereby expressly 
incorporated by reference. It is understood that the application of the teachings of the present 
invention to a specific problem or situation will be within the capabilities of one having ordinary 
skill in the art in light of the teachings contained herein. Examples of the products and processes 
of the present invention should not be construed to limit the invention. 
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WHAT IS CLAIMED IS: 

1. A method for amplifying target DNA by polymerase chain reaction (PGR) 
comprising: 

preparing a mixture of (1) said target DNA, comprising multiple DNA sequences, (2) a 
set of single-stranded oligonucleotide primers, each primer comprising (i) a 3' specific region, (ii) 
a random region, and (iii) a 5' universal region, wherein said universal region can serve as a first 
universal priming site (Ul) for further PGR amplification with a complementary primer, (3) a 
DNA polymerase, and (4) multiple deoxynucleoside triphosphates (dNTPs) under conditions 
such that said PCR primers anneal to target DNA; and 

preparing a copy of said multiple DNA sequences in said target DNA, whereby an 
amplified product is formed incorporating said first universal priming site (Ul). 

2. The method of claim 1 wherein said target DNA is genomic DNA (gDNA). 

3. The method of claim 1 wherein said target DNA is complementary DNA 
(cDNA). 

4. The method of claim 2 wherein said gDNA is human. 

5. The method of claim 4 wherein at least a fraction of the whole human genome is 
amplified from said gDNA. 

6. The method of claim 5 wherein said 3' specific region is designed to bind to a 
genomic sequence occurring in the human genome with a frequency of between about 0.01% and 
2.0%. 

7. The method of claim 1 wherein said (i) 3' specific region is between about 4 and 
12 bases in length. 

8. The method of claim 7 wherein said 3 ! specific region is 5 bases in length. 

9. The method of claim 8 wherein the sequence of said 3' specific region is selected 
from the group consisting of: AAAAA; AAAAT; AAATA; TAAAA; AGAAA; ATAAA; 
AAATT; TTAAA; GAAAA; AATAA; CAAAA; AAGAA; AAAGA; AAAAG; AAATG; 
AAACA; ACAAA; ATTTA; TGAAA; and ATATA; AATAT; AAAAC; AGAGA; AATTA; 
AATTA; AGGAA; AACAA; CCCAG; TCAAA; ATTTC; AT A AT; ACACA; CAGAA; 
AGAAT; GAGAA; GG AAA; TCCCA; ATTAA; ATGAA; AAAGT; AATGA; TTCAA; 
ATTTG; TCTCA; ACAGA; AAACT; CCAGG; AGAAG; and TATAA. 

10. The method of claim 1 wherein said (ii) random region is between about 2 and 15 
bases in length. 

11. The method of claim 10 wherein said random region is 4 bases in length. 

12. The method of claim 11 wherein the random regions present in said primers 
include all 256 possible combinations of adenosine, cytosine, guanine, and thymidine. 
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13. The method of claim 1 wherein said (iii) 5' universal region is between about 12 
and 35 bases in length. 

14. The method of claim 13 wherein the sequence of the 5 5 universal region is 
selected from the group consisting of: 

AGTGGTGCCAGCTCAGCAG (SEQ ID NO: 1); 
CATGGAGTGTTGGCCAGGA (SEQ ID NO: 2); 
AACTCTCTCCCAAGAGCGA (SEQ ID NO: 3); 
TAGCGTAGTGAGCATCCGT (SEQ ID NO: 4); 
TCCCACATTCACCGCTTAG (SEQ ID NO: 5); 
TGTAAAACGACGGCCAGT (SEQ ID NO: 6); and 
GTACGCAGTCTACGAGGC (SEQ ID NO: 7). 

15. The method of claim 4 wherein said universal region is designed to have no 
significant homology to any segment in the human genome. 

16. The method of claim 1 further comprising the step of extending said copy to 
encode a second universal priming site (U2). 

17. The method of claim 16 comprising the step of further amplifying said amplified 
product with two primers complementing said first and second universal priming sites (Ul and 
U2). 

1 8. The method of claim 1 wherein said DNA polymerase is a Taq DNA polymerase. 

19. The method of claim 18 wherein said DNA polymerase is selected from the 
group consisting of AmpliTaq Gold™ DNA polymerase, AmpliTaq™ DNA polymerase, and the 
Stoffel fragment of AmpliTaq™ DNA polymerase. 

20. The method of claim 17 further comprising the step of single-nucleotide 
polymorphism (SNP) genotyping. 

21. The method of claim 1 wherein the set of oligonucleotide primers used are 
determined according to bioinformatic prediction of expected products using in silico e-PCR. 

22. The method of claim 1 or claim 17 further comprising the step of detecting the 
amplified product by subjecting the product to a 5 '-nuclease assay. 

23. The method of claim 22 wherein the detection comprises: 
annealing a fluorogenic oligonucleotide probe to the amplified DNA; 
cleaving the probe by the 5 '-nuclease activity of the DNA polymerase; and 
quantitating a change in fluorescence. 

24. A single stranded oligonucleotide primer for polymerase chain reaction (PGR), 
comprising (1) a 3' specific region of between about 4 and 12 bases in length, (2) a random region 
of between about 2 and 15 bases in length, and (iii) a 5' universal region of about 12 and 35 bases 
in length. 



27 



WO 03/097794 PCT/US03/14491 

25. The primer of claim 24 wherein said 3' specific region is 5 bases in length. 

26. The primer of claim 25 wherein the sequence of said 3' specific region is selected 
from the group consisting of: AAAAA; AAAAT; AAATA; TAAAA; AGAAA; ATAAA; 
AAATT; TTAAA; GAAAA; AATAA; CAAAA; AAGAA; AAAGA; AAAAG; AAATG; 
AAACA; ACAAA; ATTTA; TGAAA; and ATATA; AATAT; AAAAC; AGAGA; AATTA; 
AATTA; AGGAA; AACAA; CCCAG; TCAAA; ATTTC; ATAAT; ACACA; CAGAA; 
AGAAT; GAGAA; GGAAA; TCCCA; ATTAA; ATGAA; AAAGT; AATGA; TTCAA; 
ATTTG; TCTCA; ACAGA; AAACT; CCAGG; AGAAG; and TATAA. 

27. The primer of claim 24 wherein the 3 ! specific region is designed to bind a 
genomic sequence occurring in the human genome with a frequency of between about 0.01% and 
2.0 %. 

28. The primer of claim 24 wherein said random region is 4 bases in length. 

29. The primer of claim 28 wherein the sequence of said random region is selected 
from the group consisting of the 4 4 possible combinations of bases. 

30. The primer of claim 24 wherein said universal region is designed to have no 
significant homology to any segment in the human genome. 

31. A set of single-stranded oligonucleotide primers for amplification of genomic 
DNA (gDNA) in a polymerase chain reaction (PCR) 5 each of said primers comprising (1) a 3 1 
specific region of between about 4 and 12 bases in lengthy (2) a random region of between about 
2 and 15 bases in length, and (iii) a 5' universal region of about 12 and 3 bases in length. 

32. The set of claim 31 wherein in each primer the 3' specific region is 5 bases in 

length. 

33. The set of claim 32 wherein the sequence of the 3' specific region is selected 
from the group consisting of: AAAAA; AAAAT; AAATA; TAAAA; AGAAA; ATAAA; 
AAATT; TTAAA; GAAAA; AATAA; CAAAA; AAGAA; AAAGA; AAAAG; AAATG; 
AAACA; ACAAA; ATTTA; TGAAA; and ATATA; AATAT; AAAAC; AGAGA; AATTA; 
AATTA; AGGAA; AACAA; CCCAG; TCAAA; ATTTC; ATAAT; ACACA; CAGAA; 
AGAAT; GAGAA; GGAAA; TCCCA; ATTAA; ATGAA; AAAGT; AATGA; TTCAA; 
ATTTG; TCTCA; ACAGA; AAACT; CCAGG; AGAAG; and TATAA. 

34. The set of claim 31 wherein in each primer the random region is 4 bases in 

length. 

35. The set of claim 34 wherein the random regions present in said primers include 
all 4 4 possible combinations of adenosine, cytosine, guanine and thymidine. 

36. The set of claim 31 wherein the sequence of the 5 ! universal region is selected 
from the group consisting of: 

AGTGGTGCCAGCTCAGCAG (SEQ ID NO: 1); 
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CATGGAGTGTTGGCCAGGA (SEQ ID NO: 2); 
AACTCTCTCCCAAGAGCGA (SEQ ID NO: 3); 
TAGCGTAGTGAGCATCCGT (SEQ ID NO: 4); 
TCCCACATTCACCGCTTAG (SEQ ID NO: 5); 
TGTAAAACGACGGCCAGT (SEQ ID NO: 6); and 
GTACGCAGTCTACGAGGC (SEQ ID NO: 7). 

37. A primer-target duplex comprising the single stranded oligonucleotide primer of 
claim 24 and a target DNA wherein the binding region of the single stranded oligonucleotide 
primer is annealed to the target DNA. 

38. Use of a set of single-stranded oligonucleotide primers for the amplification of a 
target DNA, said primers comprising: (i) a 3' specific region, (ii) a random region, and (iii) a 5' 
universal region, wherein said universal region can serve as a first universal priming site (Ul) for 
further PCR amplification with a complementary primer. 

39. The use of claim 38, wherein said (iii) 5' universal region is between about 12 
and 35 bases in length. 

40. The use of claim 38, wherein the sequence of the 5' universal region is selected 
from the group consisting of: 

AGTGGTGCCAGCTCAGCAG (SEQ ID NO: 1); 
CATGGAGTGTTGGCCAGGA (SEQ ID NO: 2); 
AACTCTCTCCCAAGAGCGA (SEQ ID NO: 3); 
TAGCGTAGTGAGCATCCGT (SEQ ID NO: 4); 
TCCCACATTCACCGCTTAG (SEQ ID NO: 5); 
TGTAAAACGACGGCCAGT (SEQ ID NO: 6); and 
GTACGCAGTCTACGAGGC (SEQ ID NO: 7). 

41. The use of Claim 38, wherein said universal region is designed to have no 
significant homology to any segment in the human genome. 
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SEQUENCE LISTING 

<110> APPLERA CORPORATION 
Lao, Kai Qin 
Chen, Caifu 
Coehler, Ryan 
Scafe, Charles 
Schroth, Gary 

<120> UNIVERSAL-TAGGED OLIGONUCLEOTIDE PRIMERS AND METHODS OF USE 



<130> ABIOS.004VPC 
<160> 36 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic oligonucleotide primer designed to have 
no significant homology to the human genome. 

<400> 1 

agtggtgcca gctcagcag 19 

<210> 2 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic oligonucleotide primer designed to have 
no significant homology to the human genome. 

<400> 2 

catggagtgt tggccagga 19 

<210> 3 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic oligonucleotide primer designed to have 
no significant homology to the human genome. 

<400> 3 

aactctctcc caagagcga 19 

<210> 4 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 
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<223> Synthetic oligonucleotide primer designed to have 
no significant homology to the human genome. 

<400> 4 

tagcgtagtg agcatccgt 19 

<210> 5 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic oligonucleotide primer designed to have 
no significant homology to the human genome. 

<400> 5 

tcccacattc accgcttag 19 

<210> 6 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic oligonucleotide primer designed to have 
no significant homology to the human genome. 

<400> 6 

tgtaaaacga cggccagt 18 

<210> 7 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic oligonucleotide primer designed to have 
no significant homology to the human genome. 

<400> 7 

gtacgcagtc tacgaggc 18 

<210> 8 
<211> 10 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic oligonucleotide template having no 
significant homolgy to the human genome. 

<221> unsure 
<222> 1, 2, 3, 4 
<223> n = A, T, C or G 



<400> 8 
nnnnaaaaaa 

<210> 9 
<211> 10 
<212> DNA 



10 
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<220> 

<223> Synthetic oligonucleotide template having no 
significant horaolgy to the human genome. 

<221> unsure 
<222> 7, 8, 9, 10 
<223> n = A/ T, C or G 

<400> 9 

ttttttnnnn 10 

<210> 10 
<211> 93 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic oligonucleotide template having no 
significant homolgy to the human genome. 

<221> unsure 

<222> (0) ... (0) 

<223> n = A, T , C or G 

<400> 10 

tttttnnnnc ctagcgtagt gagcatccgt aagcattcat cgtgaggggt ctttgtcctc 60 
ctatggtagc agtcacgagg catnnnnaaa aaa 93 

<210> 11 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic oligonucleotide primer with homology to 
a synthetic template having no significant homolgy 
to the human genome . 

<400> 11 

cctagcgtag tgagcatccg t 21 

<210> 12 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic oligonucleotide primer with homology to 
a synthetic template having no significant homolgy 
to the human genome . 

<400> 12 

atgcctcgtg actgctacca 20 

<210> 13 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Synthetic oligonucleotide probe with homology to 
a synthetic template having no significant homolgy 
to the human genome. 

<400> 13 

aagcattcat cgtgaggggt ctttgtcctc eta 33 

<210> 14 
<211> 9 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Synthetic oligonucleotide primer having no 
significant homolgy to the human genome. 

<221> unsure 
<222> 6, 7, 8, 9 
<223> n = A, T, C or G 

<221> misc_f eature 

<222> 1, 2, 3, 4, 5 

<223> A locked nucleic acid. 



<210> 15 
<211> 10 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic oligonucleotide primer having no 
significant homolgy to the human genome. 

<221> unsure 

<222> (0) ... (0) 

<223> n = A, T , C or G 

<221> misc_feature 
<222> 1, 2, 3, 4, 5, 6 
<223> A locked nucleic acid. 

<400> 15 

ttttttnnnn 10 

<210> 16 
<211> 14 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> A site in a ynthetic oligonucleotide template 

having no significant homolgy to the human genome. 



<400> 14 
tttttnnnn 



9 



<400> 16 



aaaaaaaaaa aaaa 



14 



<210> 17 
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<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> A synthetic oligonucleotide primer having homology 
with a synthetic oligonucleotide template having 
no significant homology to the human genome. 

<400> 17 

cctagcgtag tgagcatccg t 

<210> 18 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> A synthetic oligonucleotide primer having homology 
with a synthetic oligonucleotide template having 
no significant homology to the human genome. 

<400> 18 

tggtagcagt cacgaggcat 

<210> 19 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> A synthetic oligonucleotide probe having homology 
with a synthetic oligonucleotide template having 
no significant homology to the human genome. 

<400> 19 

gaagtggcaa cagagaagga a 

<210> 20 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> A synthetic oligonucleotide primer having no 
significant homology to the human genome. 

<400> 20 

atgcctcgta gactgcgtac gttttttttt 

<210> 21 
<211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> A synthetic oligonucleotide primer having no 
significant homology to the human genome. 

<400> 21 

atgcctcgta gactgcgtac gttttttttt t 
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<210> 22 
<211> 31 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> A synthetic oligonucleotide primer having no 
significant homology to the human genome. 

<2 2 1> misc_f eature 
<222> 26 

<223> A locked nucleic acid. 
<400> 22 

atgcctcgta gactgcgtac gttttttttt t 31 

<210> 23 
<211> 31 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> A synthetic oligonucleotide primer having no 
significant homology to the human genome. 

<221> misc_feature 
<222> 24, 26 

<223> A locked nucleic acid derivative. 
<400> 23 

atgcctcgta gactgcgtac gttttttttt t 31 

<210> 24 
<211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> A synthetic oligonucleotide primer having no 
significant homology to the human genome. 

<221> misc_feature 
<222> 24, 26, 28 

<223> A locked nucleic acid derivative. 
<400> 24 

atgcctcgta gactgcgtac gttttttttt t 31 

<210> 25 
<211> 31 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> A synthetic oligonucleotide primer having no 
significant homology to the human genome. 

<221> misc_f eature 
<222> 23, 24, 25, 26, 27 

<223> A locked nucleic acid derivative. 



6/9 



WO 03/097794 



PCT/US03/14491 



<400> 25 

atgcctcgta gactgcgtac gttttttttt t 



31 



<210> 26 
<211> 77 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> A synthetic oligonucleotide template having no 
significant homology to the human genome. 

<400> 26 

cctagcgtag tgagcatccg tagaagtggc aacagagaag gaatggtagc agtcacgagg 60 
cataaaaaaa aaaaaaa 77 

<210> 27 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> A synthetic oligonucleotide primer comprising 
human sequence. 



<210> 28 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> A synthetic oligonucleotide primer comprising 
human sequence. 

<400> 28 

catggagtgt tggccaggag ggatgagc 28 

<210> 29 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> A synthetic oligonucleotide primer comprising 
human sequence. 



<210> 30 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> A synthetic oligonucleotide primer having no 
significant homology to the human genome. 



<400> 27 



catggagtgt tggccaggag gtcacaga 



28 



<400> 29 

agtggtgcca gctcagcagc ctgggaga 



28 



7/9 



WO 03/097794 



PCT/US03/14491 



<221> unsure 

<222> (0) . . . (0) 

<223> n = A, T, C or G 



<400> 30 

catggagtgt tggccaggan nnnggctg 



28 



<210> 31 
<211> 28 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> A synthetic oligonucleotide primer having no 
significant homology to the human genome. 

<221> unsure 

<222> (0) ... (0) 

<223> n = A, T, C or G 

<400> 31 

agtggtgcca gctcagcagn nnnggaga 28 

<210> 32 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> A synthetic oligonucleotide primer comprising 
human sequence. 



<210> 33 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> A synthetic oligonucleotide primer comprising 
human sequence. 

<400> 33 

tggaggacag aggaaaggga 20 

<210> 34 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> A synthetic oligonucleotide probe comprising human 
sequence . 



<400> 32 



agcaacgggc tgaaggc 



17 



<400> 34 

agatgcagcc agttcagatc ttcccg 



26 



<210> 35 
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<211> 434 

<212> DNA 

<213> Homo sapiens 



<400> 35 

ttgagctgca ggttgaatcc ggggtgcctt taggattcag caccatggcg gaagacatgg 60 

agaccaaaat caagaactac aagaccgccc cttttgacag ccgcttcccc aaccagaacc 120 

agactagaaa ctgctggcag aactacctgg acttccaccg ctgtcagaag gcaatgaccg 180 

ctaaaggagg cgatatctct gtgtgcgaat ggtaccagcg tgtgtaccag tccctctgcc 240 

ccacatcctg ggtcacagac tgggatgagc aacgggctga aggcacgttt cccgggaaga 300 

tctgaactgg ctgcatctcc ctttcctctg tcctccatcc ttctcccagg atggtgaagg 360 

gggacctggt acccagtgat ccccacccca ggatcctaaa tcatgactta cctgctaata 420 

aaaactcatt ggaa , 434 



<210> 36 
<211> 46 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> A synthetic oligonucleotide template having no 
significant homology with human sequence. 

<400> 36 

ggtcacagac gggaagatct gaactggctg catcttttct cccagg 4 6 
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